Evaluating Language Technologies: The MULTIDOC Approach to Taming the Knowledge Soup
نویسندگان
چکیده
In this paper we report on ongoing verification and validation work within the MULTIDOC project. This project is situated in the field of multilingual automotive product documentation. One central task is the evaluation of existing off-the-shelf and research based language technology (LT) products and components for the purpose of supporting or even reorganising the documentation production chain along three diagnostic dimensions: the process proper, the documentation quality and the translatability of the process output. In this application scenario, LT components shall control and ensure that predefined quality criteria are applicable and measurable to the documentation end-product as well as to the information objects that form the basic building blocks of the end-product. In this scenario, multilinguality is of crucial importance. It shall be introduced or prepared, and maintained as early as possible in the documentation workflow to ensure a better and faster translation process. A prerequisite for the evaluation process is the thorough definition of these dimensions in terms of user quality requirements and LT developer quality requirements. In our approach, we define the output quality of the whole documentation process as the pivot where user requirements and developer requirements shall meet. For this, it turned out that a so-called "braided" diagnostic evaluation is very well suited to cover both views. Since no generally approved standards or even valid specifications for standards exist for the evaluation of LT products, we have adjusted existing standards for the evaluation of software products, in particular ISO 9001, ISO 9000-3, ISO/IEC 12119, ISO 9004 and ISO 9126. This is feasible because an LT product consists of a software part and a lingware part. The adaptation had to be accomplished for the latter part.
منابع مشابه
Deploying the SAE J2450 Translation Quality Metric in MT Projects
This paper provides a nutshell description of how the recently published proposal of a translation quality metric for automotive service information is applicable in an evaluation scenario that deploys multilingual human language technology (mHLT). This proposal is the result of the J2450 task force group of the Society of Automotive Engineers (SAE). The main focus of the developed metric is on...
متن کاملAdvertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles
When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...
متن کاملEvaluating the Effectiveness of Explicit and Implicit Form-Focused Instruction on Explicit and Implicit Knowledge of EFL Learners
Although explicit and implicit knowledge of language learners are essential to theoretical and pedagogical debates in second language acquisition (SLA), little research has addressed the effects of instructional interventions on the two knowledge types (R. Ellis, 2005).This study examined the relative effectiveness of explicit and implicit types of form-focused instruction (FFI) on the acquisit...
متن کاملEvaluating the Success of the Visual Learners in Vocabulary Learning through Word List versus Sentence Making Approaches
Thisstudy sought to evaluate the learners' achievements with the visual learning style when exposed to the sentence making and word list approaches. On that account, 45 basic level participants who studied at the Iran Language Institute (ILI), Bushehr, took part in this research study. At the outset, the learners were given Barsch learning style inventory (1991) to determine the learners' learn...
متن کاملDeveloping and Evaluating the Validity and Reliability of the Knowledge, Attitude, and Practice Questionnaire of Iranian Mothers about the Development of Communication, Language, Speech, and Swallowing of Persian-Speaking Children Aged 18 to 36 Month
Background and Objectives: The mother's knowledge and attitude about the child's developmental norms can affect their practice and the quality of parent-child interaction. The quality of a child's development in the early years significantly impacts their personality and future success. Therefore, this study aims to develop and investigate the psychometric characteristics of the Iranian mother'...
متن کامل